FPGA-Based Prototype of Nexus++ Task Manager
نویسندگان
چکیده
StarSs is one of several programming models that try to relieve parallel programming. In StarSs, the programmer has to identify pieces of code that can be executed as tasks, as well as their inputs and outputs. Thereafter, the runtime system (RTS) determines the dependencies between tasks and schedules ready tasks onto worker cores. Previous work has shown, however, that the StarSs RTS may constitute a bottleneck that limits the scalability of the system and proposed a hardware task management system called Nexus++ to eliminate this bottleneck. The first prototype of Nexus++ was implemented in SystemC. Its architecture also had a nondeterministic multi-cycle search algorithm in its critical path, potentially limiting its scalability. In this paper, we improved the architecture of Nexus++ and employed a multi-way set-associative cache-like data structures to optimize its search algorithm and increase task throughput. We also modeled the new architecture in VHDL and targeted a Virtex 5 FPGA from Xilinx. Experimental results show that the new architecture is very resource-efficient utilizing only 19% of the target FPGA. It also shows that Nexus++ achieves a speedup of up to 81× using some synthetic benchmarks modeled after H.264 decoding. Hence, Nexus++ significantly enhances the scalability of applications parallelized using StarSs.
منابع مشابه
FPGA-Based Prototype of the Task Superscalar Architecture
In this paper, we present the rst hardware implementation of a prototype of the Task Superscalar architecture; an experimental task-based data ow scheduler that dynamically detects inter-task data dependencies, identi es task-level parallelism, and executes tasks out-of-order. The implemented hardware is based on a distributed design that can operate in parallel and is easily scalable to manage...
متن کاملImplementation of a Cooperative Agent Architecture Based on the Language-Action Perspective
In this paper the architecture and implementation of Cooperative Information Agents (CIA) is described. Taking a language-action perspective to the design of CIAs allows for the specification of obligations and authorizations, and results in the separation of tasks (things the agentmust do) and contracts (mutually agreed commitments to the course of communication). The architecture describes th...
متن کاملPerformance Evaluation of a Firewall-Compliant Globus-based Wide-Area Cluster System
In this paper, we present a performance evaluation of a wide-area cluster system based on a rewallenabled Globus metacomputing toolkit. In order to establish communication links beyond the rewall, we have designed and implemented a resource manager called RMF (Resource Manager beyond the Firewall) and the Nexus Proxy, which relays TCP communication links beyond the rewall. In order to extend th...
متن کاملCooperative Agent Architecture based on the Language-Action Perspective
In this paper the architecture and implementation of Cooperative Information Agents (CIA) is described. Taking a language-action perspective to the design of CIAs allows for the specification of obligations and authorizations, and results in the separation of tasks (things the agent must do) and contracts (mutually agreed commitments to the course of communication). The architecture describes t...
متن کاملResource Manager for Globus-Based Wide-Area Cluster Computing
In this paper, we present a new type of Globus resource allocation manager (GRAM) called RMF (Resource Manager beyond the Firewall) for wide-area cluster computing. RMFmanages computing resources such as cluster systems and enables utilization of them beyond the rewall in global computing environments. RMF consists of two basic modules, a remote job queuing system (Q system) and a resource allo...
متن کامل